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Estimation of benefit of prevention of occupational 
cancer for comparative risk assessment: methods 
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ABSTRACT 

Objectives To quantify the life years gained and financial 
savings by preventing a case of occupational cancer. 
Methods The authors retrieved data from the Taiwan 
Cancer Registry and linked them with the National 
Mortality Registry to estimate the survival functions for 
major occupational cancers: lung, pleural mesothelioma, 
urinary bladder and leukaemia. Assuming a constant 
excess hazard for each type of cancer, the authors 
extrapolated lifetime survival functions by the Monte 
Carlo method. For each patient with cancer, the authors 
simulated an age- and gender-matched person without 
cancer based on vital statistics of Taiwan to estimate life 
expectancy and expected years of life lost (EYLL). By 
using the reimbursement data from the National Health 
Insurance Research Database, the authors calculated the 
average monthly healthcare expenditures, which were 
summed to estimate the lifetime healthcare expenditures 
after adjusting for the corresponding monthly survival 
probability. 

Results A total of 51 408, 136, 12 891 and 5285 new 
cases of lung, pleural mesothelioma, bladder and 
leukaemia cancers, respectively, were identified during 
1997-2005 and followed until the end of 2007. The 
EYLL was predicted to be 13.7±0.1, 18.9±0.7, 
4.7±0.3 and 19.4±0.5 years for these cancers, 
respectively, and the lifetime healthcare expenditures 
with a 3% annual discount were predicted to be US 
$22 359, US$14900, US$51 987 and US$59 741, 
respectively. 

Conclusions The burden of these occupational cancers, 
in terms of EYLL and lifetime healthcare expenditures, 
was substantial. Such estimates may provide useful 
empirical evidence for comparative risk assessment that 
can be applied in health policy-making and clinical 
decision-making. 



INTRODUCTION 

Cancer is a major public health issue because of the 
associated reduction in quantity and quality of life 
in patients and financial burden placed on patients, 
their families and the society. In the study of the 
aetiology of human cancer, research in occupational 
and environmental epidemiology has identified risk 
factors of cancer that may be of practical implica- 
tions for prevention. A WHO study found that 
approximately 19% (12%— 29%) of all cancers were 
estimated to be attributable to the environment, 
resulting in 1.3 million deaths each year world- 
wide. 1 The International Agency for Research on 
Cancer has evaluated over 900 potential carcino- 



What this paper adds 



► Population-based information is limited on the 
quantified health benefits associated with the 
prevention of occupational cancer. 

► Our study illustrated a practical approach to 
estimate EYLL and lifetime healthcare expendi- 
tures for major occupational cancers, using the 
cancer registry and National Health Insurance 
administrative databases. 

► The burden of these occupational cancers was 
substantial in terms of EYLL and lifetime 
healthcare expenditures. Such estimates may 
provide useful empirical evidence for compara- 
tive risk assessment used in health policy- 
making and clinical decision-making. 



gens in the past 4 decades, documenting the 
workplace as a major source of exposure. Occupa- 
tional cancers are potentially preventable, 2 and the 
burden of occupational cancer has been estimated 
in terms of attributable fractions, in the range of 
2%— 8%. 3 ~ 5 These estimates imply that a substan- 
tial percentage of cancer occurrences could be 
possibly prevented. 

Among various definitions of risks, 6 the British 
Standard 18004:2008 describes it as composed of 
two factors: likelihood of a hazardous event or 
exposure (s) and the consequence or severity of 
injury or ill health that can be caused by the event 
or exposure(s). 7 It provides us a basis to quantify 
the impact of risk by taking account of both like- 
lihood and consequence of the event. 8 Nevertheless, 
the consequences or outcomes generally cannot be 
directly compared. To have policy implications on 
the potential impact, a common unit of measure- 
ment of outcomes and benefits is needed. Ideally, 
the valuation of the human health benefits gained 
from environmental regulations or public health 
programmes would include all costs to society, 
including medical costs, work-related costs, educa- 
tional costs, the cost of support services required by 
medical conditions and/or the willingness of indi- 
viduals to pay to avoid the health risks. Direct 
medical costs, as suggested by the US Environ- 
mental Protection Agency, often provide a lower 
bound estimate of the costs of illness. 9 For 
comparative risk assessment, we proposed the use 
of life years or quality-adjusted life years 8 gained 
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and financial savings on healthcare expenditures as the metric of 
outcome evaluation for comparison with other benefits gained 
through policy investment. 

However, there is a general lack of population-based infor- 
mation on the quantification of health benefits associated with 
the prevention of occupational cancer. In addition, the investi- 
gation of overall medical resource utilisation and economic 
burden of cancer at a national level is still limited. 10-13 The 
objective of this study is to quantify the life years gained and the 
potential financial savings from preventing a case of occupa- 
tional cancer. 

METHODS 

This study began after the approval of the Institutional Review 
Board of the National Taiwan University Hospital (NTUH IRB 
no. 200808029R). We used two different datasets to cross-vali- 
date the estimation of the survival functions in this study: the 
Taiwan Cancer Registry (TCR) and the National Health Insur- 
ance Research Database (NHIRD). 

Taiwan Cancer Registry 

We retrieved data from the TCR and linked them with the 
National Mortality Registry to estimate the survival functions 
for major occupational cancer. The cancer sites of interest in this 
study included lung (ICD-9-CM code: 162), pleural mesothe- 
lioma (ICD-9-CM code: 163), urinary bladder (ICD-9-CM code: 
188) and all leukaemia cases other than chronic lymphocytic 
leukaemia (ICD-9-CM codes: 204.0, 205.0, 205.1). Only patients 
aged 18 years or older with histopathological evidence of the 
disease were included in this study. A total of 69 720 patients 
were identified with the diagnoses of four major occupational 
cancers from 1 July 1997 to 31 December 2005. The dataset was 
then linked with the National Mortality Registry database to 
verify the vital status for each case until the end of 2007. 

National Health Insurance Research Database 

The reimbursement data of the NHIRD from 1997 to 2007, 
which contained data for all outpatients and inpatients with 
diagnoses involving the major occupational cancers, were also 
utilised. Taiwan established a single-payer National Health 
Insurance (NHI) programme in 1995. At the end of 2008, the 
NHI covered 99.0% of Taiwan's population of 23 million. The 
NHIRD consists of original claims data for reimbursement and 
a registry of all enrolees, which allowed researchers to trace all 
medical services received by enrolees under the NHI programme. 
To secure the protection of personal confidentiality, all the 
personal identification numbers were encrypted, and the reim- 
bursement data were transformed into a research database, 
namely the NHIRD, which were regularly maintained by the 
National Health Research Institutes. Taiwan's NHI covers 
almost all healthcare services for catastrophic illnesses, except 
some novel technologies and medications for which there is no 
evidenced-based consensus on their effectiveness. As all types of 
cancer can be registered as a catastrophic illness and treated 
without copayments, the criteria for the registration were very 
strict and established by the medical specialty boards and the 
Bureau of NHI. Therefore, we only included the four types of 
cancer cases (as described above) that were successfully regis- 
tered in the dataset of catastrophic illnesses from 1 July 1997 to 
31 December 2007. To ensure that all cases were incident cases 
during the study period, we excluded the potentially prevalent 
cases that were admitted in any hospital with a diagnosis of any 
of the four types of cancer before 1 July 1997. 



Extrapolation method to obtain lifetime survival functions 

Monte Carlo simulation was used to extrapolate survival for up 
to 50 years to derive the lifetime survival function after the 
diagnosis of each cancer. Briefly, the survival function for an age- 
and gender-matched reference population was generated using 
the Monte Carlo method from the life tables of the general 
population of Taiwan in the corresponding year. The lifetime 
survival of the patients with cancer (up to 50 years) was 
obtained using linear extrapolation of a logit-transformed curve 
of the survival ratio between the cancer cohort and reference 
population, under the assumption of a constant excess hazard 
model. 14 15 The expected years of life lost (EYLL) for a specific 
disease was defined as the lifetime survival difference between 
the disease cohort and an age- and gender-matched reference 
population. We estimated the average EYLL by calculating the 
difference in the areas under the long-term survival curves 
between the cohort of cancer patients and the age- and gender- 
matched reference population. To facilitate the estimation, we 
used the ISQoL software, which was built in the R statistical 
package and can be freely downloaded from http://www.stat. 
sinica.edu.tw/jshwang. 

Measurement of lifetime costs paid by the NHI 

We used the NHIRD data to establish the cohorts of incident 
cancer patients for the estimation of the lifetime healthcare 
expenditures from the perspective of the NHI. The total lifetime 
cost of a patient refers to all the direct healthcare expenditures 
paid by the NHI from the date of cancer diagnosis until the date 
of death. Patients that were alive at the end of 2007 were 
censored. The dates of death were retrieved from hospitalisation 
files and the catastrophic illness registration files. By retrieving 
the reimbursement data from the NHIRD, we were able to 
calculate the average healthcare expenditures spent by the 
patients with different types of cancer at time t, which can be 
summed for the lifetime after adjusting for the corresponding 
survival probability at time t. The effective sample size in each 
month in the follow-up period was applied for the calculation of 
the average monthly healthcare expenditure using the SAS 
software. The lifetime healthcare expenditure per case was 
estimated by multiplying the monthly mean survival probability 
with the corresponding average monthly healthcare expenditure, 
adjusting for the annual discount rate. 

Validation of the Monte Carlo extrapolation and comparison with 
the Kaplan— Meier method 

Empirical data from the TCR and NHIRD provided an oppor- 
tunity to cross-validate the estimation of survival functions and 
the actual performance of the Monte Carlo extrapolation. We 
first included a subcohort of patients diagnosed as having the 
cancers of interest between 1 July 1997 and 31 December 2001. 
The subcohorts were only followed until the end of 2001 or for 
a period of 4.5 years. Then, we extrapolated the data through 
the end of 2007 or for an additional period of 6 years, using 
the Monte Carlo method to estimate mean survival months. 
For the cohorts that were followed to the end of 2007, the 
Kaplan— Meier method was applied to calculate the mean 
survival month based on a follow-up of 10.5 years as the 'gold 
standard' for comparison. We presented the relative biases for 
each cancer to show the differences between the Kaplan— Meier 
estimates and those of the Monte Carlo extrapolation method 
for both the TCR and the NHIRD. Because of the limited 
sample size (N=136) of pleural mesotheliomas, we only 
performed validation for lung cancer, bladder cancer and 
leukaemia patients. 
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Table 1 Characteristics of cancer cohorts and estimated lifetime healthcare expenditures based on TCR (July 1997—2005) and NHIRD during 

10.5 years of follow-up (July 1997 to December 2007) 

Lung cancer Mesothelioma Bladder cancer Leukaemia 

Characteristics TCR NHIRD TCR NHIRD TCR NHIRD TCR NHIRD 



Number 


51 408 


68 926 


136 


428 


12891 


20128 


5285 


4967 


Age (years), mean±SD (range) 


67.7±12.0 


67.6±12.2 


60.0±14.9 


64.3±14.9 


67.6± 1 2.6 


68.0±12.6 


53.7±19.4 


54.6±19.0 




(18-101) 


(18-103) 


(20-97) 


(22-97) 


(18-106) 


(19-105) 


(18-97) 


(18-100) 


Median survival (month) 


8.1 


8.3 


6.0 


8.2 


93.5 


101.1 


12.8 


18.7 


Male, % 


68.6 


66.5 


71.3 


63.8 


72.4 


71.6 


58.6 


58.4 


Life expectancy (years) 


2.4±0.1 


2.4±0.1 


2.5±0.7 


3.5±0.7 


11.3±0.3 


11.6±0.3 


6.9±0.5 


7.6±0.7 


EYLL 


13.7±0.1 


13.8±0.1 


18.9±0.7 


15.2±0.7 


4.7±0.3 


4.3±0.3 


19.4+0.5 


18.3±0.7 


Lifetime healthcare expenditures (US$), 


mean±SE 
















Discount 3% 


















After cancer diagnosis 


22 359 ±2033 


22 483 ±2094 


14 900±2844 


18856±2720 


51 987±3022 


53 347±3363 


59 741 ±4518 


68 275±6326 


Included duration 3 months before 


22 667±1971 


22 791 ±2033 


15 209±2844 


19165±2720 


52 553±3098 


53 876±3363 


60 042±4418 


68577±6225 


diagnosis 


















Discount 5% 


















After cancer diagnosis 


22 051 ±2033 


22 236±2033 


14776±2535 


18609±2782 


49 493 ±2947 


50 778±3136 


56 729±4016 


65 765±4719 


Included duration 3 months before 


22 359±1971 


22483±1971 


15 023±2535 


18918±2720 


50 022±2947 


51 307±3136 


57 030±4016 


66 067±4619 



diagnosis 



1US$ =29.322 New Taiwan Dollar in 2010. 

EYLL, expected years of life lost; NHIRD, National Health Insurance Research Database; TCR, Taiwan Cancer Registry. 



Statistical analysis 

The statistical analyses were performed using the SAS software, 
V.9.2 (SAS Institute Inc.). To apply Monte Carlo simulation, we 
used the ISQoL software (available from http://www.stat. sinica. 
edu.tw/jshwang) for the estimation of lifetime survival func- 
tions and the calculation of EYLL and lifetime healthcare 
expenditures. We also performed sensitivity analyses considering 
an annual discount rate of 3% and 5% and with a disease duration 
that included the 3 months before the date of cancer diagnosis. 

RESULTS 

The descriptive characteristics of the four cancer cohorts are 
summarised in table 1. The mean age for leukaemia patients is 
the youngest, which generally occurred in the early 50s, 
compared with the other three types of cancer, which generally 
occur at an average age of early to late 60s. As the median 
survival for lung cancer and mesothelioma are generally <1 year 
and their life expectancies are usually <4 years, these two types 
of cancer have a high EYLL. Leukaemia also shows a similar 
trend. Therefore, the successful prevention for these three types 
of cancer would save 13—19 life years per case plus the estimated 
lifetime healthcare expenditures. For lung cancer, the estimates 
of median survival, EYLL and lifetime healthcare expenditures 
based on the TCR data were comparable with those based on 
the NHIRD data (table 1). The cohort of pleural mesothelioma is 
only composed of 136 patients in the TCR, likely due to the 
difficulty in diagnosing this cancer solely based on pathology. 
The number recorded in the NHIRD during the same period was 
428. All the estimates were slightly different, although they 



showed a similar trend. For example, the mean (±SE) lifetime 
healthcare expenditures based on the TCR after cancer diagnosis 
was US$15 703 (±2411) at an annual discount rate of 3%, while 
the mean based on the NHIRD was US$19598 (±2720). The 
patients with bladder cancer had the highest life expectancy of 
11.3 years, and their lifetime healthcare expenditures generally 
exceeded US$50 000. Prevention of leukaemia would save an 
EYLL of 19.4 years per case and a lifetime healthcare expendi- 
tures of approximately US$60 000-USS70 000, based on this 
study. The sensitivity analyses with different discount rates and 
disease costs, either after cancer diagnosis or with a disease 
duration that included 3 months prior to diagnosis, did not have 
a large effect except for urinary bladder cancer, which has a life 
expectancy of 11.3— 11. 6 years. 

The survivals based on the subcohorts of cancer patients 
established in the 4.5-year period were extrapolated to an addi- 
tional 6 years using the Monte Carlo method and then compared 
with actual survival months calculated by the Kaplan— Meier 
method that used the complete 10.5-year follow-up from July 
1997 to December 2007. The relative biases for the two methods 
are summarised in table 2. The relative biases for the Monte 
Carlo method were generally below 5% except for leukaemia, 
which ranged between —10.64% and —12.89%. The differences 
of relative biases between the TCR and NHIRD for the same 
cohort and time period were <2.5%, indicating a high agreement 
between the two datasets. The Monte Carlo method tended to 
underestimate the long-term survival compared with the 
Kaplan— Meier method, and the largest relative biases were 
consistently found in the leukaemia patients. 



Table 2 Estimates of mean survival months in 1 0.5 years of follow-up using the Monte Carlo method based on the first 4.5 years of follow-up data for 
the TCR and the NHIRD were compared with the Kaplan— Meier estimates based on 10.5 years of follow-up (July 1997 to December 2007) 



TCR 



NHIRD 



10.5-year follow-up 



Extrapolation based on the first 4.5-year 
follow-up 

Monte Carlo method 



10.5-year follow-up 



Extrapolation based on the first 4.5-year 
follow-up 

Monte Carlo method 



Cancer site 


Mean 


Mean 


SE 


Relative bias (%) 


Mean 


Mean 


SE 


Relative bias (%) 


Lung 


21.14 


20.07 


0.4 


-5.06 


23.87 


23.45 


0.56 


-1.76 


Bladder 


77.81 


76.36 


1.45 


-1.86 


82.08 


81.70 


1.54 


-0.47 


Leukaemia 


38.04 


33.14 


1.79 


-12.89 


52.15 


46.6 


2.63 


-10.64 



NHIRD, National Health Insurance Research Database; TCR, Taiwan Cancer Registry. 
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DISCUSSION 

Our study illustrated a practical approach to estimate EYLL and 
lifetime healthcare expenditures for occupational cancers using 
data from administrative databases. A particular strength of this 
study is the use of two separate national population-based 
databases, the TCR and NHIRD, with adequate sample sizes of 
cancer patients and a long follow-up period (July 1997 to 
December 2007). This enabled us to cross-validate the estimates. 
All life expectancy estimates for the cancers from the two 
databases were relatively close or within a 1-year difference, 
implying the validity of datasets and the consistency of esti- 
mation method. However, there was a difference of EYLL for 
mesothelioma estimated from the TCR and NHIRD 
(18.9— 15. 2=3. 7 years). The difference is likely attributed to the 
different mean ages at diagnosis, that is, 64.3—60.0=4.3 years. 
The cancer patients included in the TCR dataset were those 
with histopathological evidence of disease and were approxi- 
mately 4 years younger than those registered in the NHIRD, 
which used clinical data only. Our estimate of lifetime cost to 
the NHI also corroborates with another study using claims data 
from a single tertiary care medical centre in Taiwan during 
1999—2002; the undiscounted average cost up to 10 years for 
lung cancer was NTS 448 371 (1US$ = NTS 33 in 2002), which 
is equivalent to USI16703 in 2010. 10 Thus, we tentatively 
concluded that the two datasets, TCR and NHIRD, and the 
method used in this study are valid. The resulting estimates may 
be useful in future health policy decisions, especially regarding 
cancer prevention programmes. 

Based on the results that were estimated from histopatho- 
logically diagnosed cancer cases abstracted from the TCR 
dataset, we found that the successful prevention of a case of 
lung cancer, pleural mesothelioma, bladder cancer and leukaemia 
would save US$22359, US$14 900, US$51987 and US$59 741, 
respectively. If a prevention programme costs more than it were 
estimated to save, the residual money can be divided by the 
number of life years saved to obtain an estimate of cost-effec- 
tiveness or cost per life year gained, which can be directly 
compared with different clinical treatments, rehabilitation, etc. 
Moreover, the establishment of national occupational standards 
for carcinogens and strict enforcement of these standards may 
provide following additional benefits: the savings of human 
capital loss (indirect costs) due to the occurrence of cancer, the 
health benefits of preventing non-cancer outcomes related to the 
same exposures and the avoidance of human suffering among 
cancer patients and their families. 16 

The lifetime healthcare expenditures of the patients with 
bladder cancer were much greater than those with mesothe- 
lioma or lung cancer because these patients generally had 
a longer life expectancy than the others. Avritscher et al 17 at the 
University of Texas MD Anderson Cancer Centre estimated that 
the average cost for bladder cancer was US$65 158, and the 
predicted lifetime costs for patients averaged US$120 684 for the 
best-case scenario and US$99 270 for the worst-case scenario. 
Longer survival is generally associated with a longer period of 
surveillance, a higher likelihood of treatment of recurrent 
tumours and a higher cost of treating with possible complica- 
tions. The mean estimates of lifetime healthcare expenditures 
have their variance that is expressed in SE, as shown in the table 
1. Because our data did not include the details of sources of 
variance, 9 we simply show figure 1 to illustrate the temporal 
changes of lifetime healthcare expenditures, using lung cancer as 
an example, which shows that the cost is generally higher at the 
time of diagnosis and initial treatment, stabilises after 6—12 



Lifetime direct mean costs due to lung cancer 



100 000 i 



\ Survival curve of lung 




40 60 80 
Months after diagnosis 

Figure 1 The dynamic fluctuation of mean monthly healthcare 
expenditures reimbursed by the National Health Insurance beginning at 
the time of cancer diagnosis, as well as their 95% confidence limits. The 
shaded area under the curve is the lifetime healthcare expenditures for 
an average case of lung cancer patient. 

months and then is slightly elevated near the end of life. With 
the advanced development of new technologies, medications 
and therapies in cancer treatment, the survival of cancer patients 
will be improved and healthcare expenditures will increase. 
Thus, the current estimates may be lower than the true value of 
the healthcare expenditure paid by the NHI in the future. 

Limitations 

This study has the following limitations that should be 
addressed. First, we adopted the insurer's perspective, and only 
direct medical costs were estimated in this study. Based on the 
estimates of US National Institutes of Health, the 2007 overall 
annual costs of cancer were US$226.8 billion; all healthcare 
expenditures were US$103.8 billion, and indirect morbidity costs 
and mortality costs accounted for more than 50% of the total 
costs of cancer. 18 Due to the lack of empirical data on the costs 
of lost productivity due to illness or premature death, our results 
underestimate the cost of illness to the whole society. Because 
the estimation of cost of illness is generally a lower bound of 
that of applying the willingness-to-pay method, it implies that 
our results leave the true figure to some extent. 19 Moreover, 
since the average healthcare expenditure is much lower 
compared with all the developed countries, 20 it requires appro- 
priate adjustments for the results to be transferable to other 
countries or jurisdictions. 21 However, since we have quantified 
the EYLL, one may also use these estimates for calculating the 
cost-effectiveness ratio for a prevention programme that is able 
to provide the expected number of prevented cancer cases. 

Second, because the data from the NHIRD were collected 
primarily for health insurance administration use, we are 
concerned that the follow-up of patient survival status may not 
be comprehensive. Using the TCR data cross-linked with the 
National Mortality Registry database as the gold standard, we 
have validated that there is relatively little bias in calculating the 
survival functions based on the NHIRD data (table 2). 

Third, the data on disease severity, including cancer staging, 
and a patient's socioeconomic status were not available in both 
databases; these factors may influence the prognosis of cancer 
and lifetime cost estimation. Future studies are needed to 
address the potential effects of these factors, including estab- 
lishment of the NHI on survival and life expectancy for 
communities with different socioeconomic statuses. 

Fourth, there were no available data regarding the occupa- 
tional exposure to carcinogens in the databases that were used in 
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this study. Cancers caused by occupational agents tend to affect 
younger individuals, especially if the initial exposure to the 
carcinogen occurs early in their working life. 22 In addition to the 
life shortening effect directly from work-related exposure to 
carcinogens, potential interactions between occupational factors 
and personal lifestyles may also contribute to additional life 
shortening effect, such as synergistic interaction between 
smoking and asbestos exposure in Americans 23 and Chinese 
workers. 24 Therefore, EYLL and the cost in occupational cancer 
will be higher than non-occupational-related cancers. The esti- 
mates, based on all cases of cancer, are more likely an underes- 
timation of the prevention benefits of occupational cancers. 

CONCLUSIONS 

In conclusion, we have proposed a feasible approach for 
measuring EYLL, as well as the lifetime healthcare expenditures 
for occupational cancers, based on the analyses of population- 
based administrative databases. The same method could be 
applied in future studies to stratify cancer patients according to 
their cancer stages and evaluate the health benefits of early 
detection of cancer. This study has provided empirical evidence 
that the burden from these occupational cancers on EYLL and 
lifetime healthcare expenditures was substantial, which warrants 
the attention of policy makers and highlights the importance of 
investing more resources in prevention programmes. 
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